Virus Labs & Distribution
VLAD #4 - Win Infection


		
		Windows Executable Infection 
			   by
		  Qark and Quantum [VLAD]


 This document attempts to explain technically NewExe infection for the
 virus writer.  This isn't for the novice coder and you'll need to
 understand DOS EXE infection before being able to use any of it.

 The infection described in detail here is the same as used by the
 accompanying WinSurfer virus, there are other methods that can be done 
 but they aren't our concern (it's up to you to develop new code!)

 You will want a copy of the New Exe header information which is available
 in Ralf Browns interrupt listings under Int21h AH=4Bh, a copy of which is
 included at the end of this article.

 This is a map of what we are trying to do:
 
       Before infection                  After Infection

     0h---------------------           0h---------------------   
       |                   |             |                   |
       |  DOS EXE header   |             |  DOS EXE header   |
       |                   |             |                   |
       ---------------------             ---------------------
       |                   |             |                   |
       |  DOS Code         |             |  DOS Code         |
       |                   |             |                   |
       |                   |         3F8h---------------------
   400h---------------------             |                   |
       |                   | <- Move     |  New EXE Header   |
       |  New EXE Header   |     this    |  and some tables  |
       |  and some tables  |      up     |                   |
       |                   |             | ----------------- |
       | ----------------- |             |   Segment         |
       |   Segment         |             |   Table           |
       |   Table           |             |                   |
       |                   |             |                   |
       |                   |   Insert -> | ----------------- | Virus segment
       | ----------------- |    this     | ----------------- | entry
       |   Misc Other      |             |   Misc Other      |
       |   Tables          |             |   Tables          |
       |                   |             |                   |
       x                   x             x                   x
       x                   x             x                   x
       x                   x             x                   x
 CS:IP ---------------------             ---------------------
       |                   |             |                   |
       |   Windows Code    |             |   Windows Code    |
       |   and Misc        |             |   and Misc        |
       |   Segments        |             |   Segments        |
       |                   |             |                   |
   End ---------------------       CS:IP ---------------------
					 | Virus segment     |
			     Append all  | with code etc     |
				this ->  |                   |
					 --------------------- Virus
					 |                   | Relocation
				     End --------------------- Entry
 

 Infection Theory:
 -----------------
 Test the EXE to make sure its windows and that the NE is at offset 1024.
 Reduce the NE pointer at 3ch by eight.
 Reduce DOS SP by eight.
 Any NE offsets which are larger than the segment table offset must be
  increased by eight.
 Increment the segment counter.
 Save the CS:IP.
 Point the CS:IP to the new segment entry
 Calculate the position of the end of the segment table, move all data up to
  and including the segment table up by eight bytes.
 Add another segment entry
 Write the virus to the end of the file
 Write relocation entry to the end of the file.
 ---

 That is a very basic overview of what is wanted.

 The main idea of it all is moving the NE header forward to add a new
 segment table entry, and changing the CS:IP to point to it.

 We'll go through all the stages step by step.

 Testing for a Windows executable.
 ---------------------------------

 A NewEXE file always begins with a DOS header and some DOS executable
 code.  With a windows executable:
	The word at offset 0 is 'MZ'.
	The word at offset 18h is 40h or greater.
 assuming that these conditions are met then we also want the pointer to the
 NE header (3ch) to be equal to 400h.  The reason for this is that room is 
 needed to move the NE header forward by 8.

 Rewriting the DOS header.
 -------------------------

 Before rewriting the DOS header, reduce the DOS SP field (10h) by 8, because
 if it was pointing to the end of the DOS code section it would point to
 the start of the NE header. (No real reason for doing it really)
 The NE header pointer at 3ch must be reduced by 8, because we intend on
 moving most of the NE header forward to that position.
 
 Modifications to the NE header.
 -------------------------------

 From this point onwards we are referring to the offsets in relation to the
 NE header.

 The pointer to the segment table is a word at 22h.  If any of the pointers
 at 4,24h,26h,28h,2ah are larger than [22h], then increase that
 pointer by eight.  The reason for this is that since we are inserting a
 new segment table entry, all tables behind the segment table will be
 eight bytes further from the start of the header.

 The segment counter is a word at offset 1ch.  Increment it by one because
 since we are adding another segment, we need to indicate this in the
 header.

 At offset 14h is the dword pointer to the program entry point, CS:IP
 where the CS is actually the segment table entry number.  Save this pointer
 for use in your relocation entry (that part will be explained in detail
 later on).  Now point the CS:IP instead to the virus segment.  Do this
 by writing the necessary starting IP value to 14h (offset of the entry into
 your segment) and the segment counter value into 16h (since the virus
 segment is the last in the segment table it will be equal to the segment
 counter value which we have incremented).

 You can work this out for yourself, but what you need to do now is move
 the entire NE header, upto and including the segment table, but not
 the data behind it, forward by eight bytes.  Heres an equation on
 calculating how much to move:
	((segmentcounter-1)*8)+word ptr [22h]

 Directly after this position is where to insert the new virus segment entry.

 The Virus Segment Entry.
 ------------------------

 Format of new executable segment table record:
  00h    WORD    offset in file (shift left by alignment shift for byte offs)
  02h    WORD    length of image in file (0000h = 64K)
  04h    WORD    segment attributes (see below)
  06h    WORD    number of bytes to allocate for segment (0000h = 64K)

 Offsets 2 and 6 in the segment record are just the virus size. Offset
 4 is the segment attribute, we use 180h, which makes the segment executable
 with relocations.

 Calculating offset 0 is more difficult.  It is necessary to get the file
 length and shift it right by the alignment shift value which was saved
 earlier.  I wouldn't know how to do a dword shift so I convert it to a
 division.

 File length into DX:AX :
	mov     ax,4202h
	xor     cx,cx
	cwd
	int     21h
 Shift right DX:AX by alignment shift:
	mov     cl,byte ptr alignment_shift
	push    bx
	mov     bx,1
	shl     bx,cl
	mov     cx,bx
	pop     bx
	div     cx
     AX=required offset 0 value

 Write this eight byte table behind all the moved header.

 Writing the Virus.
 ------------------

 Lseek to the end of the executable and write the virus.

 Writing the Relocation Entry.
 -----------------------------

 You need a relocation entry so that you can return control to the host
 after the virus has done its dirty work.  The relocation entries follow
 the segment itself.

 Format of relocation entry:
 Offset  Size    Description
  00     WORD    Number of relocation entries
	 Offset  Size    Description
	  00     BYTE    relocation type
	  01     BYTE    relocation flags
	  02     WORD    offset within segment
	  04     WORD    target address segment
	  06     WORD    target address offset


 The first word will be 1 because only 1 relocation entry.
 First byte is 3 to signal a 32bit pointer (a far jump).
 Second byte is 4 to signal additive relocation. (doesn't work without this)
 Word at offset 2 is the offset of the dword after the 'far jump' opcode.
 Word at offset 4 is the CS of the original host CS:IP
 Word at offset 6 is the IP of the original host CS:IP

 To put all this into code:

		db      0eah    ;Opcode of JMP FAR PTR
    ret_ip      dw      0
		dw      0ffffh

    relocation  dw      1       ;One relocation entry
		db      3       ;32bit pointer relocation
		db      4       ;Additive relocation
		dw      offset ret_ip   ;Offset of the relocation in segment
    hostcs      dw      0       ;CS:IP of place we want our relocation to
    hostip      dw      0       ; point.

 Note: the relocation address data (ret_ip) _must_ be FFFF:0000 as above
       because windows treats it as a linked list.  If you don't understand
       don't worry, just do it!  It won't work otherwise.  It is important
       that before writing the virus body to the executable that you set
       this address to FFFF:0000 so that executable will be setup correctly.

 Write the relocation entry to the end of the file.

 Windows allows a standard Int 21h call just fine so all file manipulation
 is as simple as ever, with a few minor changes.


 Writting TSR/ISR's under Windows.
 ---------------------------------

 Windows is a protected mode environment and thus to set a vector under it we
 must manipulate the various tables that protected mode requires to be
 updated.  Of these tables one is of interest to us.  The "Interrupt
 Descriptor Table" (IDT). The IDT holds all the Protected Mode interrupt
 vectors that are in the "Interrupt Vector Table" (IVT) - if they are
 supported - and some more.  This is all well and good - all we have to do is
 set a vector in the IDT to point to our code like we do with dos and the 
 IVT but - there are complications.

 V86 mode.
 ---------

 When Windows is running in 386 enhanced mode the processor is locked into
 v86 mode.  In v86 mode each application has its own 1mb memory block.  All
 EMS/XMS is locked away from the application and (usually) each process has
 its own IDT/IVT.  The idea being to allow the application to think it is
 alone on the system.  Any attempt to access any other tasks that may be
 running is a "violation of system integrity" and will cause an exception.
 This is no good for a virus and by now you're thinking we're screwed but
 windows solves the problem for us.  Windows has but one IDT and of course
 only one IVT to shadow it and thus if we are to a hook a vector in the IDT
 it will be reflected in every task.  (Ever seen that OS/2 warp advert where
 they say "... and true multi-tasking" - well this simply means that OS/2
 warp holds a seperate IDT for every task under v86 mode.)

 Setting the Vector.
 -------------------

 So let's get this straight.. the IVT is the one at 0:0 that we all know and
 love and the IDT is a protected mode/v86 shadow of it.  So you're most
 probably asking "why can't we just hook the IVT and let windows reflect it
 into the IDT?" - well there's 2 reasons: firstly windows doesn't "reflect"
 the IVT as we would like to think, actually it copies the IVT into the IDT
 on startup then replaces certain routines with "protected mode call
 structures" which call the real mode routines and some routines it doesn't
 even do this, secondly trying to access the IVT directly is a big nono and
 will cause an exception error.
 
 Can we use DOS ?  well yes, you can.. most dos functions have been reflected
 up into windows although windows is supposed to be trying to get rid of
 them. So you can call int 21,25/35 to set the vector but remember that this
 will be setting a vector in the GVT and only at the discretion of windows.

 A better way: use DPMI.  Windows is NOT your host under protected mode or
 even v86 mode as microsoft would have you believe.  Windows has an
 overlooker that provides windows time-slicing and exception abilities.  In
 fact all windows does is act as a mediator to DPMI and basically slow things
 down.  So how do we use DPMI to set the vector ?  well.. by using functions
 0204h/0205h  - with the vector in BL  - of the DPMI provider int 31.  This
 way windows has no say over what goes in its IDT and thus we have a LOT of
 control.

 Writing the Interrupt.
 ----------------------

 This is perhaps one of the easier things to do - once you understand the
 principles.  In protected mode/v86 mode your code segment is supposed to be
 read only and thus you're supposed to have a code, stack and data segment.
 Everyone knows this is bad so what we need is a way to write to the code
 segment. You won't find one - writing to the code segment just isn't
 possible.  BUT - if we were to have a data segment that pointed to the same
 code segment then we could do it. (actually we don't have any segments in
 protect mode.. we have a thing called a "selector" which gives access
 parameters to areas of memory.)

 To get an "alias selector" to the code segment we can use another DPMI
 function and bypass windows altogether: function ax=000ah with the code
 selector in bx of int 31h, it returns a read/writeable alias selector to
 the code segment in ax.

 So now Protected mode isn't protected and we're free to write to the code
 segment. One problem arises from this idea.  Qark wanted to put this code
 to generate the alias in the loader part of the virus and store it in the
 code segment for the interrupt routine.  This seemed fine until I found out
 that windows likes to move segments around.  One minute your code might be
 at one address the next it may be at another.  Thus you should always
 regenerate the alias on every execution of the ISR or lock down the segment
 - we chose against this last approach as it can cause exception errors from
 windows (DPMI has no problems with this strategy.)

 For a detailed look at DPMI consult Ralf Brown's interrupt listing under
 int 31h.  These functions haven't been included in this text.

 Staying Resident.
 -----------------

 Finally: don't go resident off something that can be closed.  If you go
 resident off something like mine sweeper or something and the user closes
 the application, your code segment will be deleted and removed from the IDT
 and will point to an empty segment.  This will cause windows to hang.
 Solution: look for a process that will never be closed, direct action 
 infect this as soon as you find it and only ever go resident off it.

 In the 'system.ini' file in your windows directory, is a variable 'shell='
 that points to a file that is never closed.  Mostly it will be
 'progman.exe' but sometimes other programs are used, so read it in and
 infect it.

 To find the path of the windows directory, search the environment for a
 variable called 'windir=' (yes, lowercase!).  On entry to any windows
 executable ES will point to the PSP (or something functionally similar).
 The word at 2ch is the segment (selector) of the environment segment.
 Scan through this as you would within a DOS application.  We didn't discover
 this through any documentation, but when Quantum accidentally typed 'set'
 from within a DOS window.

 Facts and Possible Techniques.
 ------------------------------

 At offset 0eh in the New Executable header is the 'auto data segment index'.
 This is another segment table index, which will be automatically in DS
 on execution entry.  Although this is pure guesswork, if you set that
 'auto data segment' to the same segment as CS, (ie in every way the same
 except the segment attributes) then you could write to your CS without
 using DPMI and would also open the gateway to polymorphic windows viruses.
 (Polymorphism would be pretty useless if you had to put a DPMI call before
 you could write to your code segment.)

 Mark Ludwig's 'Windows Virus #2' uses a different form of infection in that
 it extends the length of the code segment as indicated in the segment
 table, and moves the entire host program starting from the end of the file
 back by virus length until the end of the code segment is reached, at which
 point the virus is written.  The IP is then repointed to the virus.
 There are some advantages to this, such as no need to add relocation entries
 because the jmp is intra-segment but this is greatly outweighed by the
 fact that moving an entire file is very slow and a much larger number of
 pointers in the header need to be modified to account for the change.

 There are DPMI calls for allocating memory (Int 31h ax=501h) but it is
 unknown at this point whether the memory remains allocated after the program
 is closed.  Mark Ludwig makes extensive use of these calls for temporary
 buffers in his direct action 'Windows Virus #2'.
 
 Mark Ludwig also demonstrates use of the WinAPI calls, which involve adding
 a new relocation entry for each call and pushing some stuff on the stack.
 This may become important when win95 comes out but for the moment it is
 more convenient to simply use int 21h.

 Some food for thought.
 ----------------------

 Microsoft has a dream [here we go], the dream is to control the world.  They
 intend to do this with applications like Windows [scary isn't it ?], and by
 the looks of it they might succeed.  A lot of people think computers should
 be easy, that there should be no skill involved in it and that everyone
 should HAVE to use one.  Microsoft are using this idea to flood the market
 with shit like Windows, in the hope that it will be accepted as a "standard".
 Windows is already this but Microsoft aren't happy with one success, they
 want to be on top of the hill.

 This is where we come in.  If Microsoft is going to succeed in taking over
 the world then we should be there making their lives difficult.  The
 WinSurfer virus created by us is so stable that an average windows
 user wouldn't even notice it.  [Then again the average windows user
 wouldn't notice if his hair caught on fire.]  And thus will stay with us
 for quite some time.  At least until AVers recognise the windows market
 and start releasing scanners.




 The New Executable format.     (From Ralf Browns Interrupt listing 21 AH=4B)
 --------------------------

 new executables begin running with the following register values
	AX = environment segment        (Wrong! AX=0)
	BX = offset of command tail in environment segment
	CX = size of automatic data segment (0000h = 64K)
	ES,BP = 0000h                   (Wrong! ES=PSP)
	DS = automatic data segment
	SS:SP = initial stack
  the command tail corresponds to an old executable's PSP:0081h and
  following, except that the 0Dh is turned into a NUL (00h); new
  format executables have no PSP  (All wrong)

Format of .EXE file header:
Offset  Size    Description
 00h  2 BYTEs   .EXE signature, either "MZ" or "ZM" (5A4Dh or 4D5Ah)
 02h    WORD    number of bytes in last 512-byte page of executable
 04h    WORD    total number of 512-byte pages in executable (includes any
		partial last page)
 06h    WORD    number of relocation entries
 08h    WORD    header size in paragraphs
 0Ah    WORD    minimum paragraphs of memory to allocation in addition to
		executable's size
 0Ch    WORD    maxi paragraphs to allocate in addition to executable's size
 0Eh    WORD    initial SS relative to start of executable
 10h    WORD    initial SP
 12h    WORD    checksum (one's complement of sum of all words in executable)
 14h    DWORD   initial CS:IP relative to start of executable
 18h    WORD    offset within header of relocation table
		40h or greater for new-format (NE,LE,LX,PE,etc.) executable
 1Ah    WORD    overlay number (normally 0000h = main program)
---new executable---
 1Ch  4 BYTEs   ???
 20h    WORD    behavior bits
 22h 26 BYTEs   reserved for additional behavior info
 3Ch    DWORD   offset of new executable (NE,LE,etc) header within disk file,
		or 00000000h if plain MZ executable

Format of new executable header:
Offset  Size    Description
 00h  2 BYTEs   "NE" (4Eh 45h) signature
 02h  2 BYTEs   linker version (major, then minor)
 04h    WORD    offset from start of this header to entry table (see below)
 06h    WORD    length of entry table in bytes
 08h    DWORD   file load CRC (0 in Borland's TPW)
 0Ch    BYTE    program flags (see below)
 0Dh    BYTE    application flags (see below)
 0Eh    WORD    auto data segment index
 10h    WORD    initial local heap size
 12h    WORD    initial stack size (added to data seg, 0000h if SS <> DS)
 14h    DWORD   program entry point (CS:IP), "CS" is index into segment table
 18h    DWORD   initial stack pointer (SS:SP), "SS" is segment index
		if SS=automatic data segment and SP=0000h, the stack pointer
		is set to the top of the automatic data segment, just below
		the local heap
 1Ch    WORD    segment count
 1Eh    WORD    module reference count
 20h    WORD    length of nonresident names table in bytes
 22h    WORD    offset from start of this header to segment table (see below)
 24h    WORD    offset from start of this header to resource table
 26h    WORD    offset from start of this header to resident names table
 28h    WORD    offset from start of this header to module reference table
 2Ah    WORD    offset from start of this header to imported names table
		(array of counted strings, terminated with a string of length
		  00h)
 2Ch    DWORD   offset from start of file to nonresident names table
 30h    WORD    count of moveable entry point listed in entry table
 32h    WORD    file alignment size shift count
		0 is equivalent to 9 (default 512-byte pages)
 34h    WORD    number of resource table entries
 36h    BYTE    target operating system
		00h unknown
		01h OS/2
		02h Windows
		03h European MS-DOS 4.x
		04h Windows 386
		05h BOSS (Borland Operating System Services)
 37h    BYTE    other EXE flags
		bit 0: supports long filenames
		bit 1: 2.X protected mode
		bit 2: 2.X proportional font
		bit 3: gangload area
 38h    WORD    offset to return thunks or start of gangload area
 3Ah    WORD    offset to segment reference thunks or length of gangload area
 3Ch    WORD    minimum code swap area size
 3Eh  2 BYTEs   expected Windows version (minor version first)
Note:   this header is documented in detail in the Windows 3.1 SDK
	Programmer's Reference, Vol 4.

Bitfields for new executable program flags:
Bit(s)  Description
 0-1    DGROUP type
	  0 = none
	  1 = single shared
	  2 = multiple (unshared)
	  3 = (null)
 2      global initialization
 3      protected mode only
 4      8086 instructions
 5      80286 instructions
 6      80386 instructions
 7      80x87 instructions

Bitfields for new executable application flags:
Bit(s)  Description
 0-2    application type
	001 full screen (not aware of Windows/P.M. API)
	010 compatible with Windows/P.M. API
	011 uses Windows/P.M. API
 3      is a Family Application (OS/2)
 5      0=executable, 1=errors in image
 6      non-conforming program (valid stack is not maintained)
 7      DLL or driver rather than application
	(SS:SP info invalid, CS:IP points at FAR init routine called with
	  AX=module handle which returns AX=0000h on failure, AX nonzero on
	  successful initialization)

Format of new executable segment table record:
 00h    WORD    offset in file (shift left by align shift to get byte offs)
 02h    WORD    length of image in file (0000h = 64K)
 04h    WORD    segment attributes (see below)
 06h    WORD    number of bytes to allocate for segment (0000h = 64K)
Note:   the first segment table entry is entry number 1

Bitfields for segment attributes:
Bit(s)  Description
 0      data segment rather than code segment
 1      unused???
 2      real mode
 3      iterated
 4      movable
 5      sharable
 6      preloaded rather than demand-loaded
 7      execute-only (code) or read-only (data)
 8      relocations (directly following code for this segment)
 9      debug info present
 10,11  80286 DPL bits
 12     discardable
 13-15  discard priority

Format of new executable relocation data (immediately follows segment image):
Offset  Size    Description
 00h    WORD    number of relocation items
 02h 8N BYTEs   relocation items
		Offset  Size    Description
		 00h    BYTE    relocation type
				00h LOBYTE
				02h BASE
				03h PTR
				05h OFFS
				0Bh PTR48
				0Dh OFFS32
		 01h    BYTE    flags
				bit 2: additive
		 02h    WORD    offset within segment
		 04h    WORD    target address segment
		 06h    WORD    target address offset



		[end]


- VLAD #4 INDEX -
ARTICLE.0_0       Hidden Area Story By QuantumG

ARTICLE.1_1      

Introduction
ARTICLE.1_2       Aims and Policies
ARTICLE.1_3       Greets
ARTICLE.1_4       Members/Joining
ARTICLE.1_5       Dist/Contact Info
ARTICLE.1_6       Hidden Area Info
ARTICLE.1_7       Coding the Mag

ARTICLE.2_1      

Tax Office
ARTICLE.2_2       Fight Back!
ARTICLE.2_3       Interviews
ARTICLE.2_4       Cryptanalysis
ARTICLE.2_5       Slovakia
ARTICLE.2_6       TBMem Flaws
ARTICLE.2_7       F-Prot Troubles

ARTICLE.3_1      

Win Infection
ARTICLE.3_2       WinVir14 Disasm
ARTICLE.3_3       Andropinis
ARTICLE.3_4       Super Virus-2
ARTICLE.3_5       VTBoot
ARTICLE.3_6       Ebbelwoi VQ7
ARTICLE.3_7       Unix Viruses

ARTICLE.4_1      

Virus Descriptions
ARTICLE.4_2       Ender Wiggin
ARTICLE.4_3       WinSurfer
ARTICLE.4_4       Antipode 2.0
ARTICLE.4_5       Bane
ARTICLE.4_6       RHINCE
ARTICLE.4_7       Tasha Yar

ARTICLE.5_1      

Replicator
ARTICLE.5_2       ART v2.2
ARTICLE.5_3       Good Times!
ARTICLE.5_4       DOS Idle
ARTICLE.5_5       Neither
ARTICLE.5_6       Virus Scripts
ARTICLE.5_7       What's Next ?

About VLAD - Links - Contact Us - Main